Untangling the interplay between epidemic spreading and 
transmission network dynamic 
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Abstract 

Epidemic spreading of infectious diseases is ubiquitous and has often considerable impact on public 
health and economic wealth. The large variability in spatio-temporal patterns of epidemics prohibits simple 
interventions and demands for a detailed analysis of each epidemic with respect to its infectious agent and 
the corresponding routes of transmission. To facilitate this analysis, a mathematical framework is introduced 
which links epidemic patterns to the topology and dynamics of the underlying transmission network. The 
evolution both in disease prevalence and transmission network topology are derived from a closed set of 
partial differential equations for infections without recovery which are in excellent agreement with comple- 
mentarily conducted agent based simulations. The mutual influence between the epidemic process and its 
transmission network is shown by several case studies on HIV epidemics in synthetic populations. They 
reveal context dependent key processes which drive the epidemic and which in turn can be exploited for tar- 
geted intervention strategies. The mathematical framework provides a capable toolbox to analyze epidemics 
from first principles. This allows for fast in silico modeling - and manipulation - of epidemics which is 
especially powerful if complemented with adequate empirical data for parametrization. 
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In spite of huge efforts to improve public health the spreading of infectious diseases is ubiquitous at the 
beginning of the 21st century with considerable variability in epidemic patterns among locations. Although the 
current influenza pandemic is a global challenge there are nonetheless differences in its timing in the northern 
and southern hemisphere due to seasonal effects (Hill- Another prominent example for epidemic variability 
is the prevalence of sexually transmitted diseases (STDs) and specifically HIV infections. While HIV is in 
many populations endemic at low levels or restricted to high risk groups it has become pandemic in other parts 
of the world [3] HI- In consequence, the spreading of infectious diseases cannot be understood globally but 
is always the result of many local factors such as climatic and hygienic conditions, population density and 
structure, cultural habits and mobility - to name only a few which in themselves are often interconnected and 
time dependent. Epidemic models aim to capture the mechanisms that link these factors to the epidemics that 
one eventually observes and to promote an understanding of the underlying dynamic processes as a prerequisite 
for intervention strategies (Hill. A useful abstraction in this context is to regard persons which may be infected 
as nodes of a network in which the links are the potentially infectious contacts among persons (or nodes in 
network notation). This simplification reveals one major cause for the large variability in epidemic processes: 
The structure and dynamics of this transmission network is highly context dependent: it depends on the routes 
of transmission and infectious profile of the pathogen, the behavioral patterns of its host as well as a number of 
other co-factors. 

It is therefore important to develop models that can be flexibly adapted to specific epidemic situations. As 
our focus will be to study the interplay between transmission network topology and epidemics, we will restrict 
ourselves to diseases caused by agents which lead either to immunity or death in their host, i.e. in which 
infection can occur only once. These epidemics can be described by Susceptible-Infected-Recovered or SIR 
models J5]|6l. Whereas we refer to the mathematically closely related case in which an infection eventually 
leads to the death of the host as SID model (Susceptible-Infected-Death). The classical SIR model describes 
epidemics in homogeneous and well mixed populations which is not generally a good approximation to the 
real world situation. Therefore, current epidemic models strive for a better consideration of the transmission 
network structure in order to make more realistic predictions EKH. Compartmental models consider different 
contact patterns in sub-populations and link them via a contact matrix J9). Network approaches go further in 
directly considering the distribution in each person's number of infectious contacts k (i.e. each node's degree 
k in network notation) in the transmission network iflOl [TT1 [121 . These models allow to study transmission 
networks with strong heterogeneity in the number of contacts among persons which in some cases also consider 
correlations in the way contacts are made lfT3lPT4l or clustering fl31 . While these approaches focus on static 
networks, a recent approach considers networks with arbitrary degree distributions and transient contacts and 
allows to derive the number of susceptible and infected nodes from a closed set of equations lfl6l [TTl . Pair 
models are a very general approach [ 18 ] to consider epidemics on heterogeneous networks which provide a lot 
of flexibility to consider correlations or clustering in the way contacts are made 11191 l20l but as a trade-off they 
quickly become computationally very demanding lf2TTl . 

Another assumption often implicitly made in epidemic models is that the epidemic sweeps through the 
population at time scales much shorter than the time scale of the background demographic process, i.e. natural 
birth and death processes are neglected. This can be considered a good approximation in cases such as the 
yearly influenza epidemics but it is hardly adequate for HIV epidemics which span decades. Taking motivation 
out of this shortcoming we further develop recent network epidemic models lfl6l [171 to consider demographic 
background processes. Having HIV as a case study in mind we focus on epidemics of diseases that lead to 
death after infection of susceptible persons, possibly after undergoing several stages of disease. Different from 
earlier work, our approach will also allow for an in detail study of the interplay between epidemic spreading 
and the structure and dynamics of the underlying transmission network. 
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1 Introduction of the model 



We will first introduce the mathematical framework for a simple SID model with one infectious stage I before 
death D and study its basic properties. This will then be extended to a more general SI^D model with two 
stages of disease Ij and I 2 before death. 

1.1 The SID model 

Following the ansatz of |fT6l[T71 , heterogeneity in the number of infectious contacts of each person is introduced 
by splitting the population into subgroups according to their number of infectious contacts. For each of these 
subgroups evolution equations can be written down that take into account the effects induced both by the 
epidemic and the demographic background processes. 

With the notation of Tab. [T]the equations for the numbers of susceptible and infected persons with k contacts 
S k and Ik read: 

S k = —rpsikSk new infections 



Note that implicitly is assumed that individuals who newly enter the population have k contacts with prob- 
ability p k and link randomly to those persons already present. People dying from natural causes are assumed to 
have the same average number of contacts as found in the whole population without preferences for susceptible 
or infected persons. In terms of the total number of susceptible S and infected persons / the equations read 



To close this set of equations additional equations have to be derived for p$i and pjj, the probabilities that 
a contact made by a susceptible or infected person points to an infected person. To capture the evolution of 
the network topological features the time evolution of the probability generating functions (PGF) gs(x, t) and 
gi(x, t) have to be calculated which represent the degree distributions of susceptible and infected persons. That 
means we have to follow the way susceptible and infected persons are linked and how the topological features 
of the transmission network change within these subgroups in response to the epidemic. To derive psi and pn 
we follow the techniques developed in |[T6l (cf. supporting information for detailed calculations) and conclude 
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+rjNp k - r]S k 
+r l g , (l,t)(S k „ 1 -S k ) 
-rj(kS k - (k + l)S k +i) 
-fip S i(kS k - (k + l)£fc + i) 
+rp S ikS k 
-(r ? + /z)I fc 
+r/5 / (l,t)(/ fe _i -I k ) 
-ri(kl k - (k + l)I k+1 ) 
-fip n (kl k - (k + l)4+i) 



natural birth and death 

contacts made with new nodes 

contacts lost from dying nodes 

contacts lost from nodes dying from infection 

new infections 

death 

contacts made with new nodes 
contacts lost from dying nodes 
contacts lost from nodes dying from infection 



S = r/N — rpsiMs — r]S 
I = rpsiM s - (77 + n)I. 



(1) 
(2) 



Psi 



Pn 




(3) 



(4) 
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Table 1 : Notation of the model* 



A k ( A = Ak ) number of persons in group A with k contacts (and 

total no. in A) 

Nk = Y1,A Ak (N = Ylk Nk ) number of persons with k contacts (and total no. of 

persons) 

(fc) jr-. ,\ 

PAk = — = 9a fc! ' probability for a person in group A to have k con- 

tacts 

gA(x, t) = J2k PAk{t)x k probability generating function (PGF) of PAk(t) 

(k)A = 5^(1) t) average number of contacts of A persons 

p k = jJf- probability for a person to have k contacts 

9{x, t) = Y,kPk{t) xk = Ea t^9a(x, t) probability generating function (PGF) of p k {t) 

(k) = g'(l,t) average number of contacts 

ph probability for a person entering the population to 

have k contacts 

g(x, t) = J2kPk(t) xk probability generating function (PGF) of pk{t) 

Ma = Yl,k kAk = Ag' A (l, t) ( M = J2 A Ma) number of links coming from A persons (and total 

no. of links) 

Mab number of links coming from A persons and point- 

ing to B persons 

P AB = M Ma probability for a link starting form an A person to 

point to a B person 

*Note that A, B correspond to the stages that are passed during an infection, e.g. S for susceptible, 
/ for infected etc. Throughout the manuscript derivatives with respect to time/spatial variables are 
denoted by a dot/prime. 



The occurrence of the probability generating functions in equations © and (@} makes obvious that the way links 
are maintained between susceptible and infected nodes depends on the contact behavior within these subgroups 
as well as the changes therein, or in other words, the time evolution of the PGFs g$(x,t) and gj(x,t) (via 

g' s (l, t), t) and M A = Ag' A (l, t), A G {5, /}). Considering g A {x, t) = J2 k (4f ~ %PAk) x k the set of 
equations can be closed with 

gs(x,t) = -rp S i(xg' s {x,t) - g' s (l,t)g s (x,t)) 
N 

+V-g{d{x,t) -g s (x,t)) 
-r](l-x)g'(l,t)gs(x,t) 

+(v + f^Psi){l-x)g' 3 {x,t) (5) 
S 

gi(x,t) = rp S ij(xg' s (x,t) - g' s (l,t) gi (x,t)) 

-r](l-x)g'(l,t)gi(x,t) 

+ -x)g'j(x,t). (6) 

With the equations given in Tab. [J we are able to investigate the interplay between epidemic processes and 
transmission network topology. In particular, it is possible to follow the full degree distribution within the sub- 
groups of susceptible and infected persons in terms of the probability generating functions gs(x, t) and gi(x, t). 
However, transmission networks do not only change their structure as a result of birth and death processes but 
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Poisson network Scale free network Poisson network Poisson network 

T) = 0, p=0 r|=0, p = r| = 0.01,p = T| = 0, p = 0.2 




Figure 1: Evolution in the numbers of susceptible (green) and infected (red) persons (top panel) as well as 
their average number of contacts per person (bottom panel). The doted, light colored curves correspond to 
the result of 100 agent based simulations, the solid lines are the results of the numerical solution of the set of 
partial differential equations (Q]|6]), parameters are chosen analogous to lfl6l . average number of contacts (k) 
= 3, transmission rate r = 0.2, recovery rate \i = 0.1. Epidemics in networks with differences in heterogeneity 
and transience in contacts are shown: The first column shows a static Poisson network with degree distribution 

Pk = jp as opposed to a network with a scale free degree distribution p^ = / £ _i\ in column 2 (same 

Li 7 ( e re ) 

average degree), column 3 corresponds to the network in column 1 with an additional demographic process 
(birth and death at a rate r] = 0.01), column 4 corresponds to column 1 with the additional feature that contact 
partners change at a rate p = 0.2. 



also due to changes in contact partners. Analogous to ifTTl we consider swapping of contact partners at a rate p 
which affects the quantities p$i and pn via an additional term p — psi\ and p (jjf- — pjA in equations 
© and ((U), respectively. 

This approach allows to study and analyze epidemics on transmission networks with a broad range of topo- 
logical features. In particular, any degree distribution which represents the contact behavior within a population 
can be implemented within the formalism by providing the corresponding probability generating function as 
an input parameter. Figure Q] shows the excellent agreement between the predictions derived from the set of 
partial differential equations (Q]|6]) and the observations from agent based simulations for several exemplary 
topologies. The approach reproduces in particular well known behavior: Epidemics in heterogeneous networks 
spread faster but are more restricted (Fig. [T] column 1 vs. 2) 11221 . Contrarily, transient contacts lead to a wider 
spread of disease as long as contacts are maintained sufficiently long to ensure transmission (Fig. [Q column 1 
vs. 4). The re-growth in the average number of contacts per person (node) in the scenario with birth and death 
processes (Fig. [Q column 3) is a reminiscent of re-emergent or persistent infections that can be observed in this 
context. 
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1.2 The SI^D model 



Following the intention to model HIV epidemics we have to extend the model to take the heterogeneous in- 
fectious profile of an HIV infection into account. The course of disease is characterized by a short but highly 
infectious period of primary infection. This is followed by a prolonged period of latent infection with a much 
lower infectivity (sometimes also referred to as asymptomatic or chronic phase) [23 ] before the onset of AIDS 
as a final stage of disease. In order to understand HIV epidemiology it is therefore important to consider the 
different stages of disease. Our focus will be on the primary infection I\ which ceases at a rate u\ and is associ- 
ated with a transmission rate r\ as opposed to the latent infection I2 with parameters fj,2 and T2- We neglect the 
role of the final stage of disease in transmission assuming that health conditions will be in conflict with a further 
transmission of HIV. The evolution equations for S^, I\k and I 2 fc are determined analogously to the SID case 
to describe the numbers of person with k contacts being susceptible, primarily infected and latently infected. In 
addition, equations describing the contacts among persons of different epidemic groups are derived (i.e. psh, 
Psi 2 > Piih> Phh an d Phh)- Finally the set of equations is closed by a derivation of the probability generating 
functions gs(x, t), gj 1 (x, t) and gj 2 (x, t) which describe the contact patterns and their temporal changes within 
each group (for details cf. supporting information). The resulting equations may look cumbersome but allow 
for a much faster (and flexible) assessment of epidemic scenarios than agent based simulations. Again it be- 
comes clear in the mathematical structure of the equations that the epidemic process cannot be decoupled from 
the underlying transmission network and that there is a mutual influence between both. 

This is exemplarily illustrated in Fig. |2]which shows the spreading of HIV in a "synthetic" population with 

_h 

a scale free distribution in the number of potentially infectious contacts. (p& = k Z e _ j\ , 7 = 2.4, tz = 20, 



average number of contacts (k) = 1.6) to which persons are born at a rate of 771 = 0.02 per year and die at a rate 
772 = 0.015 per year. Transmission and progression rates of HIV in the primary and latent stage of disease r\, 
r<i, £ti and [12 are estimated from 11231 . Agent based simulations and the numerical solution of the set of partial 
differential equations agree well and show how the epidemic saturates within a few decades with a few percent 
of latently infected individuals (as opposed to a few per mill of primarily infected individuals). During the epi- 
demic expansion phase the average number of contacts among infected persons grows shaiply while the average 
number of contacts among susceptible persons decreases slightly. Persons with more potentially infectious con- 
tacts are at a higher risk of infection and accumulate within the population of infected persons. This hierarchy 
in the average number of contacts from primarily infected over latently infected to susceptible individuals can 
still be observed during the saturation phase of the epidemic. The temporal evolution in the network topology 
can be studied in more detail from the probability generating functions gs(x,t), gi 1 (x,t) and gj 2 (x,t) of the 
degree distributions describing the contact behavior within the epidemic subgroups in Fig. [2] (right panel). The 
contact behavior of individuals newly entering the population is not time dependent and corresponds to original 
distribution found before the onset of the epidemic (g(x, t) = g(x, 0) = const). Nonetheless, the contact 
patterns in the population change specifically in the epidemic subgroups. Considering that <ju(0) t) = pao is 
the probability to find a person without infectious contacts in each group one can see that the fraction of single 
persons grows among susceptible persons as it does among latently infected persons in the saturation phase of 
disease due to the increased death rate among their infected contacts. This loss in contacts is not observed in the 
short period of primary infection. It corresponds to the maintenance of the high average number of contacts in 
this subgroup (originally acquired due to the hazard of infection growing with the number of contacts). The full 
probability distribution for the observed numbers of contacts per person in each group can be derived from the 

shown probability generating functions gA(x, t) via PAk{t) = A fc! ' ■ The changes in structure of the PGFs 
among the subgroups become even more obvious if we start with a more homogeneous topology as for example 
a network with a Poissonian degree distribution. 

Note that a persistent epidemic would not be equally observed if new persons enter the population with 




5 




10 20 30 40 50 60 10 20 30 40 50 60 10 20 10 20 

t [years] t [years] t [years] t [years] 



Figure 2: Evolution in the numbers of susceptible (green), primarily (red) and latently (orange) infected persons 
(left) as well as their average number of contacts per person during the epidemic (middle). The doted, light 
colored curves correspond to the result of 20 agent based simulations, the solid lines are the results of the 
numerical solution of the set of partial differential equations. Note that logarithmic scale was chosen to present 
the different orders of magnitude in the size of the epidemic subgroups. The epidemics takes place on a scale 
free network with average degree (k) = 1.6 into which persons are born at a rate r\\ = 0.02a -1 and die at a rate 
r/2 = 0.015a -1 . The epidemic parameters are chosen in accordance with the infectious profile of HIV 11231 . i.e. 
transmission rates in the stages of primary and latent infection are r\ = 2.76a -1 and ri = 0.1a -1 with rates of 
progression fj,± = 4.1a -1 and fi2 = 0.12a -1 . The right panel shows the evolution of the probability generating 
functions over 20 years of the epidemic representing the distribution in the number of contacts in persons newly 
entering the population (g(x,t) = const.) as well as in susceptible, primarily and latently infected persons. 
The contour plots interpolate from (dark colors) to 1 (light colors) in steps of 0.1. 



the currently found contact behavior instead of the initially found contact behavior, i.e. with g(x, t) = g{x, t) 
instead of g{x, t) = g{x, 0) = const. As high risk individuals have a higher risk of death due to the epidemic 
this will lead successively to an introduction of individuals with lower risk behavior (lower average number of 
contacts) until eventually the network becomes sub-critical and the epidemic ceases. 

2 Transmission by stage of disease 

The analysis has already taken into account that the course of an HIV infection is intimately related to the 
dynamics of its epidemic spreading: While the brief period of primary infection is associated with a largely 
increased infectivity one observes a much lower infectivity during the prolonged period of latent infection 
(sometimes also referred to as asymptomatic or chronic infection) which grows again in the late stages of disease 
ll23l l24l 1231 l26l . Therefore it cannot trivially be decided from which phase of disease most new infections 
emanate making this a topic of ongoing debate 11271 l28l . There are various studies on the contribution of the 
initial and latent stage to HIV incidence in different populations (i.e. in largely heterosexual populations 11291 
and homosexual populations ll30l ). They reveal that the contribution of either stage of disease to HIV incidence 
is very context dependent. There is however agreement that primary infection becomes a more important 
driver of the epidemic when risk behavior increases (number of casual/concurrent contacts or partners) whereas 
incidence from latent infections becomes more important in the saturation phase of an epidemic (as opposed to 
its expansion phase) which is supported by modeling approaches lOTll32l . 

This question can certainly not be answered conclusively by our study due to lack of adequate empirical 
data but we can provide a tool which helps to better understand under which circumstances infections dominate 
either from primarily infected or latently infected persons. Therefore we do not aim to parametrize the model 
in the most realistic way but investigate two possible scenarios which consider some general features found in 



6 




Figure 3: Evolution in the numbers of susceptible (green), primarily (red) and latently (orange) infected persons 
as well as their average number of contacts per person in the scenario of strong and weak concurrency (left and 
middle column). Note that logarithmic scale was chosen to present the different orders of magnitude in the size 
of the epidemic subgroups. The epidemics take place on random networks with the sketched distributions in 
the number of contacts (po to pe are shown, p& = for k > 6) into which persons are born and die at a rate 
Vi = V2 = 0.02a -1 . The epidemic parameters are chosen in accordance with the infectious profile of HIV ll23l . 
i.e. transmission rates in the stages of primary and latent infection are r\ = 2.76a -1 and r2 = 0.1a -1 with 
rates of progression /ii = 4.1a -1 and fi2 = 0.12a -1 . The right column (comparison, top) shows the fraction 
of persons in the primary (red) and latent (orange) stage of disease in the scenarios of strong concurrency (solid 
lines) and weak concurrency (dashed lines) for comparison. The final diagram shows the incidence ratio for 
both scenarios, i.e. the fraction of the number new infections from the primary over those from the latent stage 
of disease (strong concurrency - solid line, weak concurrency - dashed line). 



HIV epidemic networks via sexual transmission. There is generally large heterogeneity in the numbers of sex 
partners ll33l l34l l35l as well as in the frequency of partner change and the level of concurrency in partnerships 
[36]. Acknowledging that not only the number of infectious contacts but also their timing 071 l38l [39l is 
important for epidemic spreading we investigate two scenarios that we depict as weak and strong concurrency 
with parameters as given in Fig. [3] Both scenarios assume an average number of 10 lifetime partners during 
a lifetime of 50 years (rj = 0.02a -1 , neglecting delays due to childhood/adolescence before sexual debut). 
However, in the scenario of weak concurrency a lower average number of concurrent partners (k) is traded in 
for a higher partner change rate p in comparison to the scenario of strong concurrency. 

The results from the numerical solution of the set of PDEs is shown in Fig. [3] It gets immediately apparent 
that not only the number of partners but also the level of their concurrency has a profound impact on both 
the time scale and width of epidemic expansion. One observes a much more severe epidemic in the case of 
strong concurrency. A common feature of both epidemics is the hierarchy in the average number of contacts 
for primarily, latently infected and susceptible persons giving directions for targeted intervention strategies. To 
understand which stage of disease drives the epidemics it is specifically of interest to study the ratio in the 
number of infections that are derived from individuals in their primary and latent stages of disease, i.e. the 
incidence ratio 



r2PSh 

which is shown in the bottom right panel of Fig. [3] While infections from the primary stage of disease dominate 
in the expansion phase of the epidemic in a scenario with strong concurrency in contacts, infections from the 
latent stage of disease dominate as soon as the epidemic matures. In the case of weak concurrency (being 
a closer approximation to serial monogamy) infections from the latent stage of disease dominate throughout 
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the whole epidemic. This case study confirms that the question which stage of disease drives HIV epidemics 
cannot be answered without profound knowledge of the topology and dynamics of the underlying transmission 
network as well as knowledge about the saturation stage of the epidemic. 

3 Discussion 

A highly flexible mathematical framework has been introduced which allows to investigate the interplay be- 
tween the topology and dynamics of the transmission network and the epidemics of a corresponding infectious 
agent within a closed set of equations. The approach has been focused on pathogens which lead to death of 
their hosts after some time of infection (possibly in several stages). HIV epidemics have been considered as 
an area of application in which the newly developed method helps to understand the complex interdependen- 
cies between the HIV epidemic profile, its transmission network, and epidemic process. Several scenarios in 
synthetic populations have been investigated showing that the transmission network is not a static support that 
shapes the epidemic but to the contrary is itself shaped by the epidemic. This becomes manifest in the changing 
contact behavior of infected and susceptible persons quantified in terms of the probability generating functions 
of their degree distributions. This interplay can become even more intricate in the presence of a multi-stage 
infectious profile as observed for HIV. Still, the mathematical framework provides a capable tool to derive con- 
text dependent answers to the question which stage of disease is the major driver of the epidemic. Together this 
allows for a context sensitive identification of key epidemic processes and (sub-)groups that have to be targeted 
for maximum impact. The HIV case studies emphasize that it depends both on the maturation stage of the 
epidemics and the structure of the relevant transmission network on which groups intervention strategies (i.e. 
for example promotion of behavioral changes or vaccination programs if available) should focus. 

The current presentation gives only one example for the application of the developed method which can be 
extended to other settings with a more complex infectious profile or to epidemics with a classical SIR dynamic. 
This makes the approach useful to a very broad spectrum of epidemic scenarios which may include improved 
modeling of SIR-like infections such as measles, rubella, pertussis or influenza. The broad applicability of the 
approach makes it worthwhile to consider further improvements which stretch the limits of the model towards 
an increasingly realistic description of the epidemics we face day-to-day. Fig. [T]and Fig. [2]already indicate that 
the current approach is designed for the limit of infinite population sizes and does not account for the obvious 
fluctuations seen in the finite size agent based simulations. However, recent research has shown that stochastic 
fluctuations may have a strong influence on real world epidemic phenomena such as re-emergent epidemics l40l 
|4T1 making this an exciting direction of future research within the given mathematical framework. It is further 
assumed that contacts are made randomly fl2l without taking into account any preferences or correlations 
influencing their establishment. Social networks usually show clustered communities H2ll and some degree of 
assortativity, i.e. persons tend to mix with their likes. ll43l[T3l l9l. This often results in the generation of core 
groups which sustain and drive an epidemic and which will have to be considered in an extended model. 

In conclusion, we have presented a mathematical framework that allows to closely monitor both the epi- 
demic process and its transmission network for general SIR-like infections in an computationally efficient way. 
The current method allows for great flexibility to account for variability in transmission network topology and 
dynamics as well as pathogen specific features. Nonetheless, it will be important to assess the method's limi- 
tations in the field after parametrization with appropriate empirical data - and an exciting challenge for future 
research to expand its limits further. 
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4 Materials and Methods 



Agent based simulations were done with NetLogo V4.0.4 R41 . in which some code fragments were used from 
the model "Virus on Network" which is included in the software's model library B31 . Poisson networks were 
generated by assigning (k)N/2 links between randomly picked nodes, other random networks were generated 
on the basis of their degree sequence B6l[T2l . The numerical solution of the partial differential equations was 
done with Mathematica V6.0.2 B71 using the function NDSolve and the numerical method of lines ||4"8"1 . 



Acknowledgments 

I would like to thank D. Hollingsworth for the hospitality and inspiring discussions during my visit in the 
Department of Infectious Disease Epidemiology at Imperial College, London, J. Lower for the overall support 
of the project at the Paul-Ehrlich-Institut, and A. Bunten for his feedback on the manuscript. 

References 



WHO (2009) Pandemic H1N1. http://www.who.int/csr/disease/swineflu/en/ 



Fraser, C et al. (2009) Pandemic potential of a strain of influenza A (H1N1) : Early findings. Science 
324:1557-1561. 



WHO (2009) HIV epidemic, http://www.who.int/hiv/en/ 



(2004) HIV/ AIDS: not one epidemic but many. Lancet 364:1-2. 

Anderson, RM, May, RM (2006) Infectious diseases of humans, Dynamics and Control (Oxford Univer- 
sity Press). 

Hethcote, HW (2000) The mathematics of infectious diseases. SIAM Rev. 42:599-653. 

Bansal, S, Grenfell, BT, Meyers, LA (2007) When individual behaviour matters: homogeneous and 
network models in epidemiology. Journal of The Royal Society Interface 4:879-891. 

Diekmann, O, Heesterbeek, JA, Metz, JA (1990) On the definition and the computation of the basic repro- 
duction ratio Rq in models for infectious diseases in heterogeneous populations. J.Math.Biol. 28:365-382. 

Wallinga, J, Teunis, P, Kretzschmar, M (2006) Using data on social contacts to estimate age-specific 
transmission parameters for respiratory-spread infectious agents. Am. J. Epidemiol. 164:936-944. 

Newman, MEJ (2002) Spread of epidemic disease on networks. Physical Review E 66:016128. 

Newman, ME, Strogatz, SH, Watts, DJ (2001) Random graphs with arbitrary degree distributions and 
their applications. Phys.Rev.E.Stat.Nonlin.Soft.Matter Phys. 64:026118. 

Durrett, R (2007) Random graph dynamics (Cambridge University Press). 

Newman, ME (2002) Assortative mixing in networks. Phys.Rev.Lett. 89:208701. 

Boguna, M, Pastor-Satorras, R, Vespignani, A (2003) Absence of epidemic threshold in scale-free net- 
works with degree correlations. Phys.Rev.Lett. 90:028701. 



9 



[15] Newman, ME (2009) Random graphs with clustering. Phys.Rev.Lett. 103:058701. 

[16] Volz, E (2008) SIR dynamics in random networks with heterogeneous connectivity. J.Math.Biol. 56:293- 
310. 

[17] Volz, E, Meyers, LA (2007) Susceptible-infected-recovered epidemics in dynamic contact networks. 
Proc.Biol.Sci. 274:2925-2933. 

[18] House, T, Keeling, M (2009) Unifying modern approaches in network epidemiology, under review. 

[19] Keeling, MJ, Rand, DA, Moms, AJ (1997) Correlation models for childhood epidemics. Proc.Biol.Sci. 
264:1149-1156. 

[20] Keeling, MJ (1999) The effects of local spatial structure on epidemiological invasions. Proc.Biol.Sci. 
266:859-867. 

[21] Eames, KT, Keeling, MJ (2002) Modeling dynamic and network heterogeneities in the spread of sexually 
transmitted diseases. Proc.Natl.Acad.Sci.U.S.A 99:13330-13335. 

[22] Barthelemy, M, Barrat, A, Pastor-Satorras, R, Vespignani, A (2004) Velocity and hierarchical spread of 
epidemic outbreaks in scale-free networks. Phys.Rev.Lett. 92:178701. 

[23] Hollingsworth, TD, Anderson, RM, Fraser, C (2008) HIV-1 transmission, by stage of infection. 
J.Infect.Dis. 198:687-693. 

[24] Wawer, MJ et al. (2005) Rates of HIV-1 transmission per coital act, by stage of HIV-1 infection, in Rakai, 
Uganda. J.Infect.Dis. 191:1403-1409. 

[25] Jacquez, JA, Koopman, JS, Simon, CP, Longini, IMJ (1994) Role of the primary infection in epidemics 
of HIV infection in gay cohorts. J. Acquir. Immune. Defic.Syndr. 7:1169-1184. 

[26] Rapatski, BL, Suppe, F, Yorke, JA (2005) HIV epidemics driven by late disease stage transmission. 
J. Acquir. Immune. Defic. Syndr. 3 8 : 24 1-25 3 . 

[27] Koopman, JS, Simon, CP (2006) Response to Rapatski BL, Suppe F, Yorke JA, HIV epidemics driven by 
late disease stage transmission. J.Acquir.Immune.Defic.Syndr. 41:677. 

[28] Rapatski, BL, Suppe, F, Yorke, JA (2006) Reconciling different infectivity estimates for HIV-1. 
J. Acquir. Immune. Defic. Syndr. 43:25 3-256. 

[29] Abu-Raddad, LJ, Longini, IMJ (2008) No HIV stage is dominant in driving the HIV epidemic in sub- 
Saharan Africa. AIDS 22:1055-1061. 

[30] Xiridou, M, Geskus, R, de Wit, J, Coutinho, R, Kretzschmar, M (2004) Primary HIV infection as source 
of HIV transmission within steady and casual partnerships among homosexual men. AIDS 18:1311-1320. 

[31] Koopman, JS et al. (1997) The role of early HIV infection in the spread of HIV through populations. 
J.Acquir.Immune. Defic. Syndr.Hum.Retrovirol. 14:249-258. 

[32] Kretzschmar, M, Dietz, K (1998) The effect of pair formation and variable infectivity on the spread of an 
infection without recovery. Math.Biosci. 148:83-113. 

[33] Liljeros, F, Edling, CR, Amaral, LAN, Stanley, HE, Aberg, Y (2001) The web of human sexual contacts. 
Nature 411:907-908. 



10 



Schneeberger, A et al. (2004) Scale-free networks and sexually transmitted diseases: a description of 
observed patterns of sexual contacts in Britain and Zimbabwe. Sex Transm Dis 31:380-387. 

Johnson, AM et al. (2001) Sexual behaviour in Britain: partnerships, practices, and HIV risk behaviours. 
Lancet 358:1835-1842. 

Carael, M (2009) Sexual behaviour, in Sexual behaviour and AIDS in the developing world (Taylor & 
Francis, London) No. 4, pp 75-123. 

Chen, MI, Ghani, AC, Edmunds, J (2008) Mind the gap: the role of time between sex with two consecutive 
partners on the transmission dynamics of gonorrhea. Sex Transm.Dis. 35:435-444. 

Morris, M, Kretzschmar, M (1997) Concurrent partnerships and the spread of HIV. AIDS 11:641-648. 

Kretzschmar, M, Morris, M (1996) Measures of concurrency in networks and the spread of infectious 
disease. Math.Biosci. 133:165-195. 

Alonso, D, McKane, AJ, Pascual, M (2007) Stochastic amplification in epidemics. J.R.Soc. Interface 
4:575-582. 

Black, AJ, McKane, AJ, Nunes, A, Parisi, A (2009) Stochastic fluctuations in the susceptible-infective- 
recovered model with distributed infectious periods. Phys.Rev.E.Stat.Nonlin.Soft.Matter Phys. 80:021922. 

Girvan, M, Newman, ME (2002) Community structure in social and biological networks. 
Proc.Natl.Acad.Sci. U.S.A 99:7821-7826. 

Newman, M, Girvan, M (2003) Mixing patterns and community structure, in Statistical Mechanics of 
Complex Networks pp 66-87. 



Wilensky, U (1999) NetLogo. http://ccl.northwestern.edu/netlogo/ Center for Connected Learning and 



Computer-Based Modeling, Northwestern University. Evanston, IL. 

Stonedahl, F, Wilensky, U (2008) NetLogo Virus on a network model. 



http://ccl.northwestern.edu/netlogo/models/VirusonaNetwork Center for Connected Learning and 



Computer-Based Modeling, Northwestern University, Evanston, IL. 

Molloy, M, Reed, B (1995) A critical point for random graphs with a given degree sequence. Random 
Structures and Algorithms 6:161-180. 

Wolfram Research, Inc. (2007) Mathematica Edition: Version 6.0, Champaign, Illinois. 

(2009) Mathematica Tutorial: Numerical solution of partial differential equations. 



http://reference.wolfram.com/mathematica/tutorial/NDSolvePDE.html 



11 



Supporting Information 



The following paragraphs provide a detailed derivation of the equations for the SID and SI^D models in which 
the notation introduced in Tab. [T]is used. 

Equations for the SID model 

We consider an epidemic of a disease that is transmitted at a rate r and from which infected persons die at a rate 
[i. The demographics of the background population is determined by a birth rate r)i and a death rate r] 2 . This 
leads to the following equation for the number of susceptible and infected persons with k (infectious) contacts 

Sk = -rpsikS k + r]iNp k - rj 2 S k +r)ig'(l,t)(S k - 1 - S k ) - (rj 2 + VPsi)(kS k - (k + l)S k+1 ) (8) 

ik = +rpsikSk-(m + v)h + mg(i,t)(h-i-ik)-(v2 + mi)(kik-(k + i)ik+i)- (9) 

In terms of the total number of susceptible and infected persons the equations read 

S = niN - rpsiMs - r) 2 S (10) 
i = rpsiMs-im + ^I- (11) 

To close this set of equations we also have to derive equations for psi and pu as well as for the probability 
generating functions (PGF) g${x, t) and gi(x, t). Following the argument in |[T6l we write 

M S i M s 

PSI = ~M~s ~M~s~ Psi (12) 
M u Mj 

PH = -m7-m-j PI1 (13) 

to derive equations for the number of links among susceptible and infected persons. Noting that the average 
excess degree of a susceptible node that was reached from an infected node to susceptible or infected nodes are 
$si(S) = (1 - Psi)^gj^ and 5 S i(I) = Psi^jj, respectively, one can write 

M s = -rpsiS(g's(l,t)+g's(l,t))+ m g'(l,t)(N + S)-(2 m + ij,psi)Ms (14) 
Mj = =rpsiS(g's(l,t)+g' s (l,t))+r ]1 g'(l,t)I-(2 m + ^ + mi )M I (15) 

M S i = -rpsiM s (5si(I)-5si(S))-(r + fj,)Msi-2r] 2 Msi + riig'(l,t)I (16) 

a" (I t) 

= -r(2ps I -l)psiMs^^-(r + f i + 27 ]2 )M SI + mg'(ht)I (17) 

M n = 2rpsiMsS SI (I) + 2rM S i-2^M n -2rj 2 M n (18) 

= 2rp SI Ms^j^ + 2rMsi-2{fi + m )M II (19) 

and finally 

PSI = rp S i(l-psi)^^-(r + ^Psi(l-psi)+Vi^T^(I-(N + S)psi) (20) 

g s 0-,t) m s 

M s , ^(M) M s M s , . g'(l,t) 

pu = r—psi(2p S i - PW yji ^ ~ r j^Psipn + 2r j^Psi ~ M 1 - Piimi + m Mi ipii-(2\) 
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To close the set of equations we have also to derive the probability generating functions (PGF) according to 

(22) 

(23) 

k V 1 1 J 

which result in 

N 

9s(x,t) = -rp S i(xg' s (x,t) - g' s (l,t)g s (x,t)) +r] 1 — (g(x,t) - g s (x,t)) 

-r/i(l - x)g'(l,t)g s (x,t) + (r? 2 + ^p SJ )(l - x)g' s (x,t) (24) 
S 

gi(x,t) = rp S ij(xg' s (x,t) - g' s (l,t)g I (x,t)) 

- Vl (l - x)g'(l,t) gi (x,t) + (t? 2 + " x)g' I ( y x,t). (25) 

Equations for the SI^D model 

We extend the equations of the SID model to accommodate two infected stages I± and h with transmission 
rates r\ and T2 which cease at rates [i\ and [12, respectively. The initial equations for the number of persons 
with k contacts who are in the classes S, I\ or I2 read accordingly 

Sk = -{nPsi 1 +r2Psi 2 ) k S k + r] 1 Np k -r]2S k + rj 1 g'(l,t)(S k . 1 - S k ) 

~(V2 + V2Psh)(kS k - (k + l)S k+ i) (26) 

hk = +(npsh + r2Psi 2 ) k S k - (r) 2 + m)h k + ma '(l,t)(I lk -i - hk) 

-{m + V2Phh)(khk - (k + (27) 

hk = Hihk ~ (m + fJ-2)hk + r} ig {l,t){hk-i - hk) 

~(m + H2Vhh){khk - (k + l)hk+i)- (28) 

In terms of the total number of susceptible and infected persons the equations read 

5 = m N - {r lPsh + r 2 p Sh )Ms - r] 2 S (29) 
h = (npsh +r2Psi 2 )M s - (m + Hi)h (30) 
I2 = Vih - (m + P-2)h- (31) 

To derive the probabilities pab f° r links from A to point to B we again use 

Mab M a 

pab = 1aI- W/ ab (32) 

with A, B € {S, I\, h}- Analogous to |[T6l and to the case of the SID model we derive that the average excess 
degree of a susceptible node that was reached from an infected node (either of I\ or I2 type) to susceptible 

or infected nodes are 8 S i{h) = PSh^gjjj, $Si(h) = PSh^jjj or $Si(S) = (1 - ps h - Psi 2 )^rjj, 
respectively. With this we can summarize 
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M s = -(r 1 p57 1 +r 2 p 5 / 2 )5(^(l,t)+^(l,t))+7 ?1 g / (l,t)(^ + 5)-(2r ?2 + /i 2 p 5/2 )M s (33) 
M h = (rips/ 1 +r 2 ps/J%s(l 5 *)+5s( 1 >*))+W( 1 >*) / i-( 2 '72 + Mi + M2Pl 1 / 2 )M/ 1 (34) 
M h = mM h + rng'{l, t)I 2 - {2r) 2 + ^ 2 + ^ 2 pi 2 i 2 )M h (35) 

Msi, = -(r lP sh + r 2 ps h )M s {5si(Ii) - 6si(S)) - (n + /Ji)M 57l - 2r) 2 M SIl + (36) 

= -(npsh + r 2 psi 2 )(2psh +PSh ~ l)M s y|^| - (n + mi + 2 m )M Sh + m9' '(l,t)/i(37) 

M s/2 = -(rip 57l + r 2PSh )M s 6 S i(I 2 ) - (r a + /x 2 )M S / 2 + ft M s;i - 2t&M S j 2 + ^'(l, t)J 2 (38) 

= -(ripsJi + ™/ 2 )P5/ 2 M 5 ^4v + MiMs/i - fa + M2 + 2 m )M S i 2 + mo'ih t)h (39) 

M 7l/l = 2(np s/l + r 2 p 572 )M 5 5 5 /(/i) + 2r!M 57l - 2fiiM hh - 2r, 2 M hh (40) 

= 2(np s/l + r 2 Ps/ 2 )P5/ 1 M 5 ^4! + 2r i M s/ 1 - 2(mi + %)M /l7l (41) 

AOiJa = { r lPSh + r 2 psi 2 )M s 5 S i(I 2 ) + r 2 M 5/2 - (mi + ^ 2 )M /l/2 + MiM Tl/l - 2 % Af /l/2 (42) 

= (rips/j + r 2 p S i 2 )psi 2 M s ^^r + r 2 M Sh + pnM hh - (mi + p 2 + 2 m )M hh (43) 

M /2/2 = 2/iiM Jl/2 - 2(/x 2 + r? 2 )M /2/2 (44) 
and write down equations for psh, Psi 2 > Phh> Phh an d Pi 2 i 2 

Psh = (nPSh + r 2Psi 2 ) ( 1 - Psh - Ph ) ^77t4v + ( r iPsh + r 2 psi 2 )psh 

- (n + mi - /"2Ps/ 2 )psi! + 771 ( J i " ( N + ) (45) 

Ms 

Psh = (npsh + r 2Psi 2 )psi 2 + PiPsh - (r 2 + H 2 - H2Psi 2 )psi 2 + Vi 9 (h - (N + S><?/ 2 )(46) 

Ms 

Phh = {nPSh + r2Psi 2 ){2psh - PllIl ']\/[j~ g' (1 t ) ~ ( r lPSh + r 2PSh)jj-Phh + 2ri Jf^P SI i 

g'(l,t) 

- (Mi - IMiPhh )Phh ~ m , n .s Phh (47) 

9i Y I 1 ) l ) 

Phh = (npsh + r 2Psi 2 ) {psi 2 - Phh ) ^ ~ ( riP5/ i + r wsh ) Jj?-Phh + r 2 J^Psh 

+PiPhh ~ A*2(l - Phh)Phh - m 1 j^w Phh (48) 

Mj, . , , <?'(!,*) 

P/2/2 = Ml T7 ( 2 P/i/2 - P/a/a ~ M2 1 - P/ 2 ia )Phh ~ Vl , f , ,s Phh ■ (49) 
Mi 2 9i 2 {l,t) 

Again, to close the set of equations we need to derive the probability generating functions 

g A (x, t) = ( - ^PAk ) x k with A G {5, Ji, J 2 }, (50) 



A' 



14 



to conclude with 

g s ( Xjt ) = inPSh + s r2PSh) (M s g s (x,t) -xSg' s (x,t)) +m ^(g(x,t) - g s (x,t)) 

-771(1 - x)g'(l,t)gs{x,t) + (772 + fi 2 p S i 2 )(l - aOffsfc,*) ( 51 ) 

■ I 4-\ (nPSh +r2PSI 2 ) f.J ( .\ O / / .AN 

S/iOM) = ? (M s g h {x,t)-xSg s {x,t)) 

-771(1 - x)g'(l,t) gil (x,t) + (772 + /j 2 p/ 1 / 2 )(l - x)^ (a;, t) (52) 
gi 2 (x,t) = fi 1 ^-(g h (x,t) -g h {x,t)) 

-771(1 - a;)5'(l,t)3/ 2 (a;,/;) + (772 + fi 2 pi 2 i 2 )(l ~ x)g' l2 (x,t). (53) 
Note that the global quantities can be easily derived from these equations 

Nk = mNpk - mN k - H2hk + W (1, t)(W*-i - Nfc) + 77 2 (/cAf fc - (fc + l)iVfc+i) 

-^{psi 2 {kS k - (k + l)5 fc+ i) + j 2 (fe/ijfc - (fc+ l)/ife+i) +pi 2 i 2 (kl 2 k ~ {k + l)J 2 fc+i))(54) 
AT = (771 - 7? 2 )iV - /x 2 J 2 (55) 
M = 2 Vl g'(l, t)N - 2 V2 g'(l, t)N - 2fi 2 M h (56) 
h 

g(x,t) = ii 2 —{g{x,t) - g h (x,t)) + rn(g(x,t) - g(x,t)) - 771(1 - x)g (l,t)g(x,t) 

S I\ I 

+772(1 - x)g'(x,t) +/j 2 (1 - x)(—psi 2 g' s (x,t) + j-pi^gj^x^) + ^Pi 2 i 2 g' l2 {x,t)). (57) 

Transient contacts 

Transient contacts are considered analogous to ifTTl by swapping of partners at a rate p. This affects the proba- 
bilities for links coming from A nodes to point to B nodes pab via an additional term 

< 58 > 

in the differential equations with A,B£ {S, I, Ii,I 2 }. 
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